Goto

Collaborating Authors

 Barents Sea


A Systematic Study of Model Merging Techniques in Large Language Models

Hitit, Oğuz Kağan, Girrbach, Leander, Akata, Zeynep

arXiv.org Artificial Intelligence

Model merging combines multiple fine-tuned checkpoints into a single model without additional training, offering an attractive approach to reusing models and efficiently improving performance. However, it remains unclear whether the advantages reported for smaller models and classifiers generalize to LLMs. We present a large-scale, systematic evaluation of six state-of-the-art merging methods, including recent subspace methods, across four open-weight LLMs, twelve fine-tuned checkpoints per base model, and sixteen standard LLM benchmarks. Evaluating through standardized benchmarks, we measure both the probability that a merged model outperforms the base model and relative gains over the best individual checkpoint. Our results show that the oldest and simplest method, Task Arithmetic, is the only approach that reliably yields performance gains on LLMs. Other interference-aware and subspace merging methods typically result in significant performance drops. Our findings indicate that current merging techniques do not directly transfer to modern LLMs. This motivates the design of LLM-specific merging algorithms and merging-aware fine-tuning methods. Code will be released upon acceptance of this paper.


Ask before you Build: Rethinking AI-for-Good in Human Trafficking Interventions

Nair, Pratheeksha, Lefebvre, Gabriel, Garrel, Sophia, Molamohammadi, Maryam, Rabbany, Reihaneh

arXiv.org Artificial Intelligence

AI for good initiatives often rely on the assumption that technical interventions can resolve complex social problems. In the context of human trafficking (HT), such techno-solutionism risks oversimplifying exploitation, reinforcing power imbalances and causing harm to the very communities AI claims to support. In this paper, we introduce the Radical Questioning (RQ) framework as a five step, pre-project ethical assessment tool to critically evaluate whether AI should be built at all, especially in domains involving marginalized populations and entrenched systemic injustice. RQ does not replace principles based ethics but precedes it, offering an upstream, deliberative space to confront assumptions, map power, and consider harms before design. Using a case study in AI for HT, we demonstrate how RQ reveals overlooked sociocultural complexities and guides us away from surveillance based interventions toward survivor empowerment tools. While developed in the context of HT, RQ's five step structure can generalize to other domains, though the specific questions must be contextual. This paper situates RQ within a broader AI ethics philosophy that challenges instrumentalist norms and centers relational, reflexive responsibility.


Computer Vision for Multimedia Geolocation in Human Trafficking Investigation: A Systematic Literature Review

Bamigbade, Opeyemi, Sheppard, John, Scanlon, Mark

arXiv.org Artificial Intelligence

The task of multimedia geolocation is becoming an increasingly essential component of the digital forensics toolkit to effectively combat human trafficking, child sexual exploitation, and other illegal acts. Typically, metadata-based geolocation information is stripped when multimedia content is shared via instant messaging and social media. The intricacy of geolocating, geotagging, or finding geographical clues in this content is often overly burdensome for investigators. Recent research has shown that contemporary advancements in artificial intelligence, specifically computer vision and deep learning, show significant promise towards expediting the multimedia geolocation task. This systematic literature review thoroughly examines the state-of-the-art leveraging computer vision techniques for multimedia geolocation and assesses their potential to expedite human trafficking investigation. This includes a comprehensive overview of the application of computer vision-based approaches to multimedia geolocation, identifies their applicability in combating human trafficking, and highlights the potential implications of enhanced multimedia geolocation for prosecuting human trafficking. 123 articles inform this systematic literature review. The findings suggest numerous potential paths for future impactful research on the subject.


Strategic Geosteeering Workflow with Uncertainty Quantification and Deep Learning: A Case Study on the Goliat Field

Rammay, Muzammil Hussain, Alyaev, Sergey, Larsen, David Selvåg, Bratvold, Reidar Brumer, Saint, Craig

arXiv.org Machine Learning

The real-time interpretation of the logging-while-drilling data allows us to estimate the positions and properties of the geological layers in an anisotropic subsurface environment. Robust real-time estimations capturing uncertainty can be very useful for efficient geosteering operations. However, the model errors in the prior conceptual geological models and forward simulation of the measurements can be significant factors in the unreliable estimations of the profiles of the geological layers. The model errors are specifically pronounced when using a deep-neural-network (DNN) approximation which we use to accelerate and parallelize the simulation of the measurements. This paper presents a practical workflow consisting of offline and online phases. The offline phase includes DNN training and building of an uncertain prior near-well geo-model. The online phase uses the flexible iterative ensemble smoother (FlexIES) to perform real-time assimilation of extra-deep electromagnetic data accounting for the model errors in the approximate DNN model. We demonstrate the proposed workflow on a case study for a historic well in the Goliat Field (Barents Sea). The median of our probabilistic estimation is on-par with proprietary inversion despite the approximate DNN model and regardless of the number of layers in the chosen prior. By estimating the model errors, FlexIES automatically quantifies the uncertainty in the layers' boundaries and resistivities, which is not standard for proprietary inversion.


AI Is Helping Us Combat The Economic Problem Of Human Trafficking

#artificialintelligence

When we think of human trafficking, we often think about the despondent faces of women and children who live in slums all over the world. What if human trafficking is much closer to home than we think? In 2019, Markie Dell, stood on the TEDx stage to recount her experience of being a domestic human trafficking victim. She was an awkward teenager who was groomed by a girl that she befriended at a birthday party. She was subsequently kidnapped, drugged, sexually violated, intimidated at gunpoint into dancing in strip clubs for an entire year.


Computer cashes in big at Texas Hold 'Em tourney

AITopics Original Links

One of the proving grounds for artificial intelligence is games. Classic games have a fixed set of rules, and these make it easier for researchers to develop new techniques and algorithms that enable computers to play (and hopefully win) various games. Tic-tac-toe, checkers, and chess are all games where researchers have developed software that is capable of winning or drawing when paired off against the best human players in the world. Last weekend, researchers at the University of Alberta added another classic game to this list: poker. In a series of matches that took place over the Fourth of July weekend in Las Vegas, the researchers' Polaris poker program won against a group of top-ranked online poker players.